CDS
Accession Number | TCMCG004C41659 |
gbkey | CDS |
Protein Id | XP_025620403.1 |
Location | join(111787962..111787994,111788086..111788166,111788259..111788594,111788672..111788842,111788939..111789015,111790252..111790321,111791762..111791814,111791905..111791962,111792065..111792414,111792504..111792576,111792665..111792856,111792938..111793006,111793129..111793206,111793455..111793541,111793629..111793663,111793776..111793881,111794255..111794335,111794417..111794509,111794579..111794632,111794731..111794847,111795018..111795116,111795357..111795404,111795480..111795566,111795697..111795804,111795903..111795967,111796189..111796291,111796381..111796629,111796733..111796867) |
Gene | LOC112711888 |
GeneID | 112711888 |
Organism | Arachis hypogaea |
Protein
Length | 1035aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA476953 |
db_source | XM_025764618.1 |
Definition | pre-mRNA-processing protein 40A [Arachis hypogaea] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGTCCAATAATCCGCAATATCCGGGTTTACAGCCTCTTCGGCCTCCTGGACCTCCCATTGCTGGTTCACTGGATCCTCAGCGCAGTTTTGTTCCACCTCCCATGCCTGGTCAATATCGCCCTGTAGTTCCTACTCAGCAGCCTCAACAGTTCATGCCCATGCCATCTCAACATTACCCTCCTGTTGGTCCCAGTGGTCCCATGATGAATGTTGGAATGCCTCCTCAAAATCAACAGCCCCAATTTCCTCAACCCATTCAACAGTTACCTCCGAGATCTGGCCAACAATTGCAACTCCCACCACAGCCGCAGCCACTTCCATTGTCAGTTGCTCGGCCAAACATGCACATGACATCTGAATCAATGATGCCGCAGCCTGATTCTCAAGTGCCAAATGGCTATGCACCAAGCTTAGGTGGCCCGGGAATGCCCCTTCCAGCATCATATACGTTTGGACCATCTTCTTATGGTCAAGTGCAAACTAATTTTAGCTCTGCTAGCCAATTCCACCCTGCTTCTCAAATCCAAGCTCCCTCTAGTTCTTCTTCCCAGAGCATTACCTCTGATACAGTTGTTCTGAGCAATGACGAGAAACCTTCAACTACATCTGTCACGCCTTCAGCAACTAGCATCCAGCCTTCACTTGCTAATGGTGGATCCACAGATTGGATTGAGCATACTTCTTCTAATGGAGTAAGATATTACTACAATAAGAAGACTAAAGTATCTAGCTGGGAGAGGCCTTTCGAATTGATGACCCCAATTGAGAGGGTGGATGCAACAACAAACTGGAAGGAGTATACTAGTCCTAATGGAATAAAGTATTACTACAACAAGGTCACTAGGGAATCAAAGTGGATGATTCCTGAGGAACTGAAGTTGGCCCGTCAGCAAGTTGAAAAGGCAGTTGCCAATGGAACACATACTGATGCTCTACCGAATTCTCATACTCAACCGTCTGTAACTCCTCCTGTGATTGAAACAGCACCAACTGCAGCAGCTAATTCATCTTTGATTGGTCAAGGGGAACCATCAAGTCCTGTTTCAGTTGCTCCTGTTGTTAGTGCATCTACAAGTCATCCACAATCTGAGATGAGTTCTGGACCATCTGCCTCTCCTCATGTGGCTCCCATAACTGGAATGGCAGTGGCTGAAGTAGAGTTACCAGTAAATACTGCTACAATATCTGATGCTGCAGCAGGAAGTGATAGAGCTTCTGTTACCAATCAGAATGATGGCAACAACTTTCTGGTGAAGGATACACTGGGCTCTGCAGATGAAGTTCCAGCAGAAGATAAAGAAGATGGTAAAAATGATTCATTAGTAGAAAAAACAAATGATGTGGCTTCAGAAACAAAGGCAGATACGGTTTCAGAAACAAGGGCTGATTTGGCTTCAGAAACACAGGCAAATGTGGCTTTGGAAACAAAGGCAGATGCATCTTCAGAAACAAAAACACGTGAACCATTACCCTTAGTTTATGCAAATAAGATGGAGGCTAAAGAAGCATTCAAAGCACTGATAGAGTCTGTAAATGTTGGATCTGACTGGACATGGGATCGAACTATGCGATTAATAGTTAATGACAAAAGGTACGGTGCATTAAAATCGCTTGGAGAAAAGAAGCAAGCTTTCAATGAGTACTTAAGTCACAGGAAAAAACAGGAAGCGGAAGAAAAGCGCATGAAGCATAAAAAAGCACGAGAGGATTTTAAAAAGATGTTAGAAGAGTCCACGGAGTTGAATCCATCCACTAGATGGAGCAAAGCCGTGACAATATTTGAAAATGATGAACGTTTCAAGGCTGTTGAGCGTGACAGAGATCGCAGGGATATGTTTGATAGTTTCTTGGAGGAACTTATAAACAAGGAACGAGCAAAGGCTCAAGAAGAACGGAAGAGGAATATAACAGAGTACAGGAAGCTTTTAGAATCTTGTGACTTTATAAAAGCTAACACACAGTGGCGAAAAGTTCAAGACCGCTTAGAGGCTGACGAAAGATGTTCGCGTCTTGAGAAAATTGACCGCTTGGAAATATTCCAGGACTATTTACGTGATTTAGAGAAGGAAGAGGAAGAGCAGAAGAAGTTACTAAAGGAGGAATTGAGAAAGACAGAACGTAAAAACCGTGATGAATTCCGCAAATTGATGGAAGAGCATGTTGCTGCTGGCATTCTTACAGCAAAAACTCATTGGCGTGATTATCACATGAAGGTGAAAGATTTACCTGCATATCTGGCAGTGGCATCAAACACATCAGGTTCAACTGCAAAAGACTTATTTGAAGATGTTGCTGAAGAGCTAGAGAAACAATATCATGATGAAAAGAGTCGAATTAAGGATGCAGTGAAGTTGGCTAAGATAACATGGTCATCAACCTATACTTTTGAAGAGTTCAAATCAGCTTTATCTATTGACTCTCCTCCAATATCTGATTTTAACTTAAAGCTAGTGTTTGATGAGCTACTAGAGCGGGCTAAGGAGAAGGAAGAAAAGGAGGCCAAAAAACGGAAACGTCTAGCAGATGATTTCCTTCATTTACTATATTCTACTAAGGACATTACTGCATCTTCAAAATGGGAAGATTGTATAACACTTATTGAAGATAGTCAAGAGTTCAGATCTGTTGGAGATGATAACCGTTGCAAGGAAATATTTGAGGAGTACATTACACAACTGAAAGAACAGGCAAAAGAGGGTGAGCGGAAACGGAAAGAGGAGAGGGCAAAGAAGGAAAAGGATAGGGAGGAAAAAGAAAGACGAAAATCTAAGCAAAGAAGGGAAAAAGAAGGAGTTCGCGAAAGAGAGAAAGATAAAGCAGACAGTGACAGTGCCGACTTAACGGAGAAAGGCGATAGTAAAAACAAACGGAGGCAGCATCAAAGTCCTGAGCACACTTCTCATGAATTGGATAAAGAGAGGAGTAAGAAATCTCATGGGCATAGTAGCAGCGACAGGAAGAAATCGAAACGACATTCATCTGGTCATGAATCAGATGAAGGCCGGCATAAAAGACACAAGCGCGACCACCGCGGTGATCCTCACAGAGAAGGCGGTTATGCGGAGGCGGAAGACGATGACTATGGTAAAGATGTTGATAGATGGTAA |
Protein: MSNNPQYPGLQPLRPPGPPIAGSLDPQRSFVPPPMPGQYRPVVPTQQPQQFMPMPSQHYPPVGPSGPMMNVGMPPQNQQPQFPQPIQQLPPRSGQQLQLPPQPQPLPLSVARPNMHMTSESMMPQPDSQVPNGYAPSLGGPGMPLPASYTFGPSSYGQVQTNFSSASQFHPASQIQAPSSSSSQSITSDTVVLSNDEKPSTTSVTPSATSIQPSLANGGSTDWIEHTSSNGVRYYYNKKTKVSSWERPFELMTPIERVDATTNWKEYTSPNGIKYYYNKVTRESKWMIPEELKLARQQVEKAVANGTHTDALPNSHTQPSVTPPVIETAPTAAANSSLIGQGEPSSPVSVAPVVSASTSHPQSEMSSGPSASPHVAPITGMAVAEVELPVNTATISDAAAGSDRASVTNQNDGNNFLVKDTLGSADEVPAEDKEDGKNDSLVEKTNDVASETKADTVSETRADLASETQANVALETKADASSETKTREPLPLVYANKMEAKEAFKALIESVNVGSDWTWDRTMRLIVNDKRYGALKSLGEKKQAFNEYLSHRKKQEAEEKRMKHKKAREDFKKMLEESTELNPSTRWSKAVTIFENDERFKAVERDRDRRDMFDSFLEELINKERAKAQEERKRNITEYRKLLESCDFIKANTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLRDLEKEEEEQKKLLKEELRKTERKNRDEFRKLMEEHVAAGILTAKTHWRDYHMKVKDLPAYLAVASNTSGSTAKDLFEDVAEELEKQYHDEKSRIKDAVKLAKITWSSTYTFEEFKSALSIDSPPISDFNLKLVFDELLERAKEKEEKEAKKRKRLADDFLHLLYSTKDITASSKWEDCITLIEDSQEFRSVGDDNRCKEIFEEYITQLKEQAKEGERKRKEERAKKEKDREEKERRKSKQRREKEGVREREKDKADSDSADLTEKGDSKNKRRQHQSPEHTSHELDKERSKKSHGHSSSDRKKSKRHSSGHESDEGRHKRHKRDHRGDPHREGGYAEAEDDDYGKDVDRW |